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Abstract. In this paper, we propose a new construction of constant- 
degree expanders motivated by their application in P2P overlay networks 
and in particular in the design of robust trees overlay. 
Our key result can be stated as follows. Consider a complete binary tree 
T and construct a random pairing II between leaf nodes and internal 
nodes. We prove that the graph Gn obtained from T by contracting all 
pairs (leaf-internal nodes) achieves a constant node expansion with high 
probability. The use of our result in improving the robustness of tree 
overlays is straightforward. That is, if each physical node participating 
to the overlay manages a random pair that couples one virtual internal 
node and one virtual leaf node then the physical-node layer exhibits a 
constant expansion with high probability. We encompass the difficulty 
of obtaining this random tree virtualization by proposing a local, self- 
organizing and churn resilient uniformly-random pairing algorithm with 
0(log 2 n) running time. Our algorithm has the merit to not modify the 
original tree virtual overlay (we just control the mapping between physi- 
cal nodes and virtual nodes). Therefore, our scheme is general and can be 
applied to a large number of tree overlay implementations. We validate 
its performances in dynamic environments via extensive simulations. 



1 Introduction 

Background and Motivation P2P networks are appealing for shar- 
ing/diffusing/searching resources among heterogeneous groups of users. Effi- 
ciently organizing users in order to achieve these goals is the main concern that 
motivated the study of overlay networks. In particular, tree overlays recently be- 
comes an attractive class of overlay networks because efficient implementations 
of various communication primitives in P2P systems tied to the hierarchical and 
acyclic properties of trees such as content-based publish/subscribe or multicast 
([4, 6, 9, 14]). Many P2P and distributed variants of classical tree structures such 
as B-trees, R-trees or P-trees have been designed so far [1, 5, 7, 8, 17, 18]. 
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Because of the dynamic nature of P2P networks, robustness to faults and 
churn (e.g., frequent node join and leave) is indispensable for the service on the 
top of them to function properly. A recent trend in measuring the robustness 
of overlay networks is the evaluation of graph expansion. The (node) expansion 
h(G) of an undirected graph G = (Vg,Ec) is defined as: 
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where OS is the set of nodes that are adjacent to a node in S but not contained in 
S. The implication of node expansion is that the deletion of at least h(G)-k nodes 
is necessary to disconnect a component of k nodes in G. That is, graphs with 
good expansion are hard to be partitioned into a number of large connected 
components. In this sense, the expansion of a graph can be seen as a good 
evaluation of its resilience to faults and churn. Interestingly, the expansion of tree 
overlays is trivially 0(l/n), which is far from adequate. This weakness to faults 
is the primary reason why tree overlays are not pervasive in real applications. 

Our focus in this paper is to provide the mechanism to make tree over- 
lays robust and suitable to real applications. In particular, we are interested in 
generic schemes applicable to a large class of tree overlays with minimal extra 
cost: As seen above, there are many variations of tree-based data structures with 
distinguished characteristics, but their distributed implementations always face 
the problem how to circumvent the threat of disconnection. Therefore, provid- 
ing such a generic robustization scheme would offer the substantial benefit for 
implementing distributed tree-based data structures in a systematic way. 

Our contribution Solutions for featuring P2P tree overlays with robustness range 
from increasing the connectivity of the overlay (in order to eventually mask the 
network churn and fault) to adding additional mechanism for monitoring and 
repairing the overlay. However the efficiency of these techniques is shadowed by 
the extra-cost needed to their implementation in dynamic settings. Moreover, 
the design of those mechanisms often depends on some specific tree overlay 
implementation, and thus their generalization is difficult. Therefore, we propose 
a totally novel approach that exploits the principal of tree virtualization. That 
is, in a tree overlay one physical node may be in charge of several virtual nodes. 
The core of our approach is to use this mapping between virtual and physical 
nodes such that the physical-node layer exhibits a good robustness property. 

Our primary contribution is the following theorem which is the key in the 
construction of our random virtualization scheme: 

Theorem 1. Let T be a complete n-node binary tree with duplication of the root 
node (the duplicated root is seen to be identical to the original root). Then, we 
can define a bijective function II from leaf nodes to internal nodes. Let Gn be 
the graph obtained from T by contracting pair (v,II(v)) for all v 3 . Choosing II 
uniformly at random Gn has a constant (node) expansion with high probability. 



3 The contraction of (v, II (v)) means that we contract edge {v, II (v)} as if it exists in 
T. 



An immediate consequence of this theorem is that the physical-node layer 
achieves a constant expansion with high probability if a random chosen couple 
composed of one leaf and one internal node is assigned to each physical node. It 
should be noted that our random tree virtualization does not modify the original 
properties of the tree overlay since we only control the mapping to physical nodes. 
This feature yields a general applicability of our result to a large class of tree 
overlay implementations. 

The above result relies on the uniform random bijection (i.e. random perfect 
bipartite matching) between internal and leaf nodes in the tree overlay. There- 
fore, in order to prove the effectiveness of our proposal in a P2P context we also 
addressed the construction of random perfect bipartite matching over internal 
and leaf nodes. Interestingly, we can propose a local and self-organizing scheme 
based on the parallel random permutation algorithm by Czumaj et. al.[ll]. Our 
scheme allows us to increase the graph expansion to a constant within 0(log 2 n) 
synchronous rounds (n is the number of physical nodes). The quick convergence 
of our scheme in dynamic settings is validated through extended simulations. 

Roadmap In Section 2, we introduce the relate work mainly in the field of dis- 
tributed computing. Section 3 presents the proof of our main result. The issue 
about the distributed implementation of our scheme is explained in Section 4 
which includes the simulation result. Finally, Section 5 provides the conclusion 
and future research issues. 

2 Related works 

Expander graphs have been studied extensively in many areas of theoretical 
computer science. A good tutorial can be found in [21]. In the following we 
restrict our attention to distributed constructions with a special emphasize on 
specific P2P design. 

There are several results about expander construction in distributed settings. 
Most of those results are based on the distributed construction of random reg- 
ular graphs, which exhibit a good expansion with high probability. To the best 
of our knowledge one of the first papers that addressed expander constructions 
in peer-to-peer settings is [19]. The authors compose d Hamiltonian cycles to 
obtain a 2d-regular graph. In [20] the authors propose a fault-tolerant expander 
construction using a pre-constructed tree overlay. It provides the mechanism to 
maintain an approximate random d-regular graph under the assumption that the 
system always manages a spanning tree. The distributed construction of random 
regular graphs based on a stochastic graph transformation is also considered 
in [10,15]. They prove that repeating a specific stochastic graph modification 
(e.g., swapping the two endpoints of a length-three path) eventually returns 
a uniformly-random sampling of regular graphs. Since all the previously men- 
tioned algorithms are specialized in providing good expansion, the combination 
with overlays maintenance is out of their scope. Therefore, these works cannot 
be easily extended to a generic fault tolerant mechanism in order to improve the 



resiliency of a distributed overlay. Contrary to the previous mentioned works, 
our study can be seen as a way of identifying implicit expander properties in 
a given topological structure. There are several works along this direction. In 
[12] the authors propose a self-stabilizing constructions of spandcrs, which are 
spanning subgraphs including smaller number of edges than the original graph 
but having the asymptotically same expansion as the original 4 . Abraham et.al. 
[2] and Aspnes and Wieder [3] respectively give the analysis of the expansion 
for some specific distributed data structures (skip graphs and skip b-trees). Re- 
cently Goyal et.al. [16] prove that given a graph G, the composition of two 
random spanning trees has the expansion at least £2 (h(G) / log n), where n is the 
number of node in G. We can differentiate our result from the above works by 
its generality and the novelty of random tree virtualization concept. 

3 The expander property of Gn 

3.1 Notations 

For an undirected graph G, Vq and Eq respectively denote the sets of all nodes 
and edges in G. Given a graph G and a subset of nodes S C Vq, we define Ind(S) 
to be the subgraph of G induced by S. For a set of nodes S, its complement is 
denoted by S. The node boundary of a set S C Vq is defined as a set of nodes in 
S that connect to at least one node in S, which is denoted by dS. 

Let T = (Vt, Et) be a binary tree. The sets of leaf nodes and internal nodes 
for T are respectively denoted by L(Vr) and I(Vr)- Given a subset S C Vt, we 
also define L(S) = L(V T ) n S and I(S) = I(V T ) n S. For a (sub)tree X, the root 
node of X is denoted by r(X), and the parent of r(X) is denoted by p(X). 

3.2 Preliminary results 

In the following T denotes a complete binary tree without explicit statement. 
The root of T is denoted by r(T). We prove several auxiliary results that will be 
further used in our main result. 

Lemma 1. For any nonempty subset S C I(Vr) such that Ind(5) is connected, 
\dS\ > \S\ + 1. In particular, if r(T) g S holds, \dS\ > \S\ + 2. 

The above lemma can be generalized for any (possibly disconnected) subset 
S C I(V T ). 

Lemma 2. Given any nonempty subset S C I(Vt) \ {r(T)} such that Ind(S) of 
T consists of m connected components, \dS\ > \S\ + m + 1 holds. 

The following corollary is simply deduced from Lemma 2. 

Corollary 1. Let X be a subtree of T . For any subset S C I{V X ), \dSDV x \ > 
\S (~1 Vx | • In particular, if S is nonempty, we have \dS fl Vx \ > \S fl Vx \ + 1. 



4 A spander is also called a sparsifier. 



3.3 Main Result 

In what follows, |L(Vr) is denoted by n for short (i.e., n is the number of nodes 
in Gn)- We also assume 77 is a bijective function from leaf nodes to the set 
of internal nodes (where the root doubly appears), which is chosen from all n! 
possible functions uniformly at random. For a subset of nodes S C L(Vr), we 
define 77(5) = {n(u)\u e S} and Q n = S U 77(S). 

Provided a subset S C L(Vt) satisfying \S\ < n/2, we say a subtree X is 
S-occupied if all of its leaf nodes belong to S. A S-occupied subtree X is maximal 
if there is no S-occupied subtree X containing X as a subtree. Note that two S- 
occupied maximal subtrees X\ and X 2 in a common tree T are mutually disjoint 
and p{X\) ^ p(X 2 ) holds because of their maximality. We first show two lemmas 
used in the main proof. 

Lemma 3. Let X be a maximal S-occupied subtree for a nonempty subset S C 
L(V T ). Then, \(dQ n )nV x \ > \Q^nV x \/2 holds. 

Lemma 4. Given a subset S C L(Vr) such that \S\ < n/2, let X , Xi, ■ ■ ■ Xk be 
all maximal S-occupied subtrees and Vx = U* =1 Vx t ■ For any a < 1, Pr(|77(5 l ) n 



The implication of the above two lemmas is stated as follows: We are focusing 
on a subset of boundary nodes dQn that are associated with some "hole" (that is, 
the set of nodes not contained in Qn) in S'-occupied subtrees. Lemma 3 implies 
that at least half of the nodes organizing the hole belongs to dQn- Lemma 4 
bounds the probability that S'-occupied subtrees has the hole with size larger or 
equal to (1 — a)\I(Vx)\- We also use the following inequality: 

Fact 1 (Jensen's inequality) Let / be the convex function, pi,^,"' be a 
series of real values satisfying YmLiPi — L an d X\,X2, ••• be a series of real 
values. Then, the following inequality holds: 



We give the proof of the main theorem (the statement is refined) . 

Theorem 1. The node expansion ofGn is at least with probability 1 — o(l). 

Proof. To prove the lemma, we show that with high probability, |c?*S'| > |S|/480 
holds for any subset S C V GiT such that \S\ < n/2. In the proof, we identify 
L(T) and Va n as the same set, and thus we often refer S as a subset of L(Vr) 
without explicit statement. Given a set S, let k be the number of maximal S'- 
occupied subtrees, X = {Xi,X 2 , ■ ■ • , Xk} be all maximal S'-occupied trees, and 
V x = Ui =1 V Xi . We also define P = {p(X i )\X l € X}. Throughout this proof, 
we omit the subscript 77 of Qn- For a subset Y C Vr, let Q(Y) = Q n Y and 
q(Y) = \Q(Y)\ for short. 
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Fig. 1: Illustration of the set (dS) fl Vx, dQ(Vx), and their boundaries, in the 
proof of Theorem 1. 



The goal of this proof is to show that |<9Q| > 151/240 holds with high proba- 
bility in T for any given S. It follows that \dS\ > |5|/480 holds in Gn- To bound 
\dQ\, we first consider the cardinality of two subsets (dQ) f) Vx and d(Q D Vx) 
(= dQ(Vx))- See Figure 1 for an example. While these subsets are not mutually 
disjoint, only the roots of S'-occupied subtrees can be contained in dQ(Vx)- It 
implies 

\(dQnv x )ndQ(Vx~)\<q(P). (l) 

Lemma 2 and 3 lead to the following inequalities: 

k 

J' 



\(dQ)nVx\>l(j2\Qnv Xt \j 

>(\S\-k-q(Vx))/2. (2) 



\d(Q(V x )\ > \Q(Vx)\ + l 

>(\S\-l)-q(Vx) + l = \S\-q(V X ). (3) 

By inequalities 1, 2, and 3, we can bound the size of dQ as follows: 

\dQ\ > \(8Q) n V x \ + \dQ(Vx~)\ - q(P) 

> (\S\ -k- q(V x ))/2 + \S\ - q(V x ) - q(P) (4) 
>3(\S\-k-q(V x ))/2 + k-q(P). (5) 

We consider the following two cases according to the value of k: 
(Casel)/c> |5|/16: We show \dQ\ > |5|/240 holds for any 77. If q{P) < 12fe/13 
holds, we have \dQ\ > fc/13 > |5|/240 from inequality 5 because of q(Vx) < 
\S\-k. Furthermore, if \S\-q(V x )-q(P) > \S\/2A0, we have \dQ\ > |S'|/240from 
inequality 4. Thus, in the following argument, we assume q(P) > 12&/13 and 



\S\ - q(V x ) - q(P) < |5|/240. Consider the subgraph H of T induced by Q(P). 
We estimate the number of connected components in H to get a bound of \8Vh\ 
in T. Letting C — {C\, C2, C3, • • • C m } be the set of connected components of H, 
we associate each leaf node u in a 5-occupicd subtree Xi with the component in 
C containing p(Xi) (the node is associated with no component if p{Xi) is not in 
Q(P)). Since each node v G H has one child belonging to Vx, each component in 
H forms a line graph monotonically going up to the root. Thus, if a component 
Cj has j nodes U\, U2, • ■ • Uj, which are numbered from the leaf side, each node 
Uh (1 < h < j) has a child as the root of S'-occupicd trees having at least 2 h ~ 1 
leaf nodes (recall that T is a complete binary tree) . It follows that the number of 
nodes in S associated with C% is J2h=i 2 h ~ 1 > 2 J ' — 1. Letting k be the number 
of components in C consisting of i nodes, we have: 



where the second line is obtained by applying Jensen's inequality. To make the 
above inequality hold, the condition \S\/m < 120 m > |5|/120 is necessary. 
Next, we calculate how many nodes in BVh H Vx is occupied by Q. From the 
definition of H, Q(8Vh) docs not contain any node in P (if a node v G P is 
contained, it will be a member of Vr and thus not in OVr)- Thus, any node 
in dV H is a member of V x , Q n P, or Vx U P. Let Y = Q(V X U P n 
and y = \Y\ for short. Since any node in Q n P is not contained in Q and the 
cardinality of OVr U Vx can be bounded by g(P) (as the roots of 5-occupicd 
trees), we have the following bound from Lemma 2: 



\dQ\ > \dV H \ 0| 

> \(dv H ) n (Q n P)\ 

> \dV H \ - \(dv H )nv x \ - \(dv„) n (Vx~up)\ 

> \dV H \-q(P)-y 

> q{P)+m+l-q{P)-y 



The illustration explaining this inequality is shown in Figure 2. Since Y, 
Q(P) and Q(Vx) are mutually disjoint, we obtain y + q(Vx) + q(P) < l^l 
y <\S\- q(Vx) - q(P) < 151/240. Consequently, we obtain \dQ\ > \S\/240. 



(Case 2) k < Ld; In the following argument, given a set 5 satisfying k < |<5|/16, 
we bound the probability Pr(|<9Q| < |5|/32). From the inequality 5 and the fact 
of k-q(P) > 0, we have \dQ\ > 3(15|5|/16 - q(V x ))/2. To be \dQ\ < |5|/32, we 
need 3(15|5|/16 - q(V x ))/2 < \S\/32 <^ q(V x ) > 11|5|/12. Thus, from Lemma 
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4, we can bound the probability as follows: 

Pr(|3Q| < |5|/32) < Pr(|/7(S)| > 11\I{V X )\/12) 
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Fixing k and \S\, we look at the number of possible choices of S. Since we can 
determine a S'-occupied subtree Xi by choosing one node in T as its root, the 
set S is determined by choosing k nodes from all nodes in T. Thus, the total 
number of subset S organizing at most |S|/16 S'-occupied subtrees are bounded 

by El2i /16 (T) ^ W(|S|7ib)- Summin S U P this bound for an y \ S \ < n / 2 - The 
total number is bounded by Y^\s\=i W (\S\/io) ■ Using the union bound and a 
well-known bound (™) < (ne/m) m , we have: 

Pr I (J < |j 

\SCL(V T )\\S\<n/2 

n/2 2n \f |S| \ f\S\^ 11151/12 



= o(l). 



n 



All the details of the previous calculation are provided in the Appendix. The 
theorem is proved. □ 



4 Distributed Construction of G n 



To prove the impact of Theorem 1 in P2P settings we have to construct scalably 
a random bijection (i.e., random perfect bipartite matching) between internal 
and leaf nodes in tree overlays. In this section, we show that this distributed 
construction is possible with nice self-* properties. That is, our scheme is totally- 
distributed, uses only local information, and is self-healing in the event of nodes 
joins and leaves. In the following we state the computational model and our 
network assumptions. 

4.1 Computational Model 

We consider a set of virtual nodes (peers) distributed over a connected physi- 
cal network. Virtual nodes are structured in a binary tree overlay. Each phys- 
ical node managing a virtual node v can communicate with any physical node 
managing v's neighbors in the overlay. The communication is synchronous and 
round-based. That is, the execution is divided into a sequence of consecutive 
rounds. All messages sent in some round are guaranteed to be received within 
the same round. 

We assume that each physical node manages exactly one internal node and 
one leaf node in the virtual overlay. Moreover, we also assume that the tree 
is balanced. Note that these assumptions are not far from practice. Most of 
distributed tree overlay implementations embed balancing schemes. The preser- 
vation of matching structure is easily guaranteed by employing the strategy that 
one physical node always join as two new nodes. We can refer as an example the 
join/leave algorithm in [13], which is based on the above strategy and generally 
applicable to most of binary-tree overlays. 

4.2 Uniformly-Random Matching Construction 

The way leaf and internal nodes are matched via a physical node is generally 
dependent on the application requirements and is rarely chosen uniformly at 
random. That is, the implicit matching offered by the overlay may be extremely 
biased and the expansion factor computed in the previous section may not hold. 
Fortunately, the initial matching can be quickly "mixed" to obtain a uniformly- 
random matching. To this end we will extend the technique proposed by Czu- 
maj et. al.[ll] for fast random permutation construction to distributed scalable 
matchings in tree overlays. The following stochastic process rapidly mixes the 
sample space of all bipartite matchings between leafs and internal nodes: 

1. Each leaf node first tosses a fair coin and decides whether it is active or 
passive. 

2. Each active node randomly proves a leaf node and sends a matching-exchange 
request. 

3. The passive node receiving exactly one matching-exchange request accepts 
it, and establishes the agreement to the sender of the request. 



4. The internal nodes managed by the agreed pair are swapped. 

Note that the above process is performed by all leaf nodes concurrently in 
a infinite loop. Following the analysis by Czumaj et. al.[ll], the mixing time of 
the above process is O(logn). 

The only point that may create problems in distributed P2P settings is the 
second step. For that point, we can propose a simple solution to implement the 
random sampling mechanism with O(logn) time and message complexity. The 
algorithm is as follows: First, the prober sends a token to the root. From the root, 
the token goes down along the tree edges by selecting with equal probability one 
of its children. When the token reaches a leaf node, the destination is returned 
as the probe result. 

Overall, the distributed scalable algorithm for constructing a random bipar- 
tite matching takes 0(log 2 n) time. In the following subsection, we evaluate the 
performances of the above scheme face to churn. 

4.3 Experimental Evaluation in Dynamic Environment 

In this section, we experimentally validate the performance of our approach by 
simulation. In the simulation scenario the following four phases are repeated. 

Node join We assume that a newly-joining node knows the physical address 
of some leaf node u in the network. Let v and v' be the leaf and internal 
nodes that will be managed by the newly-joining physical node. The node u 
is replaced by a newly internal node v'. Then v and u becomes children of 
v'. 

Node leave The adversary chooses a number of nodes to make them leave. 
Since it is hard to simulate worst-case adversarial behavior, we adopt a 
heuristic strategy: given a physical node v with leaf vl and internal node 
vi, let h(v) be the height of the smallest subtree containing both vl and Vj. 
Intuitively, the physical node v with higher h(v) has much contributition for 
avoiding the node boundary to be contained in a small subtree containing 
vl. Following this intuition, the adversary always makes the node v with 
highest h(v) leave. 

Balancing Most of tree-based overlay algorithms have some balancing mecha- 
nism. While the balancing mechanism has a number of variations, we simply 
assume a standard rotation mechanism. After a number of node joins and 
leaves, the tree is balanced by standard rotation operation. 

Matching Reconstruction We run once the matching-mixing process de- 
scribed in the previous section. 

Since exact computation of node expansion is coNP-complete, we monitor the 
second smallest eigenvalue A of the graph's Laplacian matrix, which has a strong 
corelation to the node expansion: a graph with the second smalles eigenvalue A 
is a A/2-expander. In the following we propose our simulations results first in a 
churn free setting then in environments with different churn levels. Due to the 
space limitation the churn-free simulations are defered to the Anexe section. 



Without churn We ran 100 simulations of 100 rounds with 512 nodes and no 
churn. The value of A is calculated at the begin of each round. Those simulations 
tends to emphasize what is "expectable" from the mixing protocol and some of 
its dynamic properties. 

A varies from 0.263 to 0.524 with an average value of 0.502 and a standard 
deviation of 0.017. The low standard deviation and the closeness of average and 
maximum reached values of A indicates that the minimum is rarely reached. 
Basically it is obtained when most nodes become responsible of internal nodes 
that are "close" from their leaves. Intuitively if each node is responsible of an 
ancestor of its leaf, there is no additionnal links between the left and the right 
subtrees of the root. In that case we do not take benefit of mixing and get bad 
expansion properties inherited from tree structures. 

Churn prone environments. We ran 100 simulations of 100 rounds with 512 
nodes and a given rate of churn. Time is divided in seven rounds groups. During 
the first round a given percentage of new nodes join the system. During the 
second a given percentage of nodes leave the system. During the third round the 
tree is balanced and the mixing protocol is run. During other rounds the mixing 
protocol is run. A is measured at the end of each round. Nodes gracefully leave 
the system. Those simulations investigate the impact of churn on A and how fast 
our mixing protocol restores a stable configuration. 
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Fig. 3: A over time with 10% of churn 

Figure 3 shows the evolution of A over time in presence of 10% of churn (10% 
is relative to the initial number of nodes). Each step stands for a round. From a 
stable configuration where A oscillates between 0.52 and 0.48, it drops down to 
0.4 every seven rounds due to arrivals and departures. The structure is sensitive 
to churn in the sense that it significantly decreases the value of A. But on the 
other hand, proposed mixing protocol converges fast. It needs two rounds to 
reach the average "expectable" value of A. 

Figure 4 shows the evolution of A over time in presence of 30% of churn 
(30% is relative to the initial number of nodes). Each step stands for a round. 
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Fig. 4: A over time with 30% of churn 

Basically churn increases stretches the curve; down part are downer around 0.3, 
high values are quite stable around 0.45 and fluctuations are widespread. While 
the previous one 3 gives a good overview of the global behaviour of the protocol 
facing churn, this figure emphases some interesting details. First it emphasizes 
that mixing protocol is not monotonic; it might decrease A. Second, the impact 
of arrivals and departures are distinct. Moreover the magnitude of their impact 
is not predictable because the selection of bootstrap node is random. Starting 
from a stable situation arrivals will always decrease A. With the proposed join 
mechanism, a new comer is weakly connected to the rest of the system. Starting 
from a stable situation gracefull departures will almost always decrease A. But 
with the proposed leave mechanism their impact is more subtle because; they 
can largely modify the tree balance which also implies links exchanges. In some 
rare cases those exchanges (which could be thought as side effect shuffles of links) 
or the departure of weakly connected nodes could increase A. 

5 Concluding Remarks 

We proposed for the first time in the context of overlay networks a generic scheme 
that transforms any tree overlay in an expander with constant node expansion 
with high probability. More precisely, we prove that a uniform random tree vir- 
tualization yields a node expansion at least with probability 1 — o(l). Second, 
in order to demonstrate the effectiveness of our result in the context of real P2P 
networks we further propose and evaluate in different churn scenario a simple 
scheme for uniformly random tree virtualization in 0(log 2 n) running time. Our 
scheme is totally distributed and uses only local information. Moreover, in the 
event of nodes join/leave or crash our scheme is self-healing. 

The virtualization scheme itself is a promising approach and provides several 
interesting questions. We enumerate the open problems related to our result: 

— Better analysis of matching convegence: As the simulation result ex- 
hibits, the convergent value of expansion computed from A is considerably 
larger than the theoretical bound. In addition, the convegence time is also 




faster than the theoretical bound. Finding improved bounds for both of the 
expansion and convegence time is an open problem. 
— Effective utilization of expander propoerty: In addition to fault re- 
siliency, expander graphs also offer the rapidly-mixing property of random 
walks on the graph. That is, MCMC-like sampling method effectively runs 
on our scheme. It is an interesting research direction that we utilize expan- 
sion property for implementing some statistical operation over distributed 
datas or query load balancing. 

Application of virtualization scheme to other overlays: The virtu- 
alization scheme is simple and generic, and thus we can apply it to other 
well-known overlay algorithms such as Chord or Pastry. Clarifying the class 
of overlay networks where the virtualizaton scheme efficiently works is a 
challenging problem. 
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6 Omitted proofs 



We explain the proof details omitted in the main body. 

6.1 Proof of Lemma 1 

Proof. We prove the case of r(T) $ S by induction on the cardinality of S. 
(Basis) If 1 5 1 = 1, the lemma clearly holds because every internal node except 
for the root of T has degree three. (Inductive step) Suppose as the induction 
hypothesis that \dS\ > |5| + 2 holds for any connected set S of size k. we 
prove that for any v e J(Vr) that is connected to a node in S, \d(S U {v})\ > 
\S U {v}\ + 2 holds. For short, let 5" = S U {v}. Since S is connected, v is 
connected to exactly one node in S. In other words, node v has two neighbors 
of v in neither S nor OS, which are elements of dS' . In contrast, v is an element 
of dS but not in dS'. It follows that <9S" > dS — 1 + 2 holds. From the induction 
hypothesis, we obtain OS' > dS + 1 > \S\ + 1 + 2 = |5"| + 2. The case r(T) G S 
is obviously deduced from the case of r(T) S 1 . The lemma is proved. □ 

6.2 Proof of Lemma 2 

Proof. Let Ci, C2, • • • Cj, • • • C m be the set of connected components in Ind(S'). 
In the case of r(T) e S, we assume r(T) e C\ without loss of generality. We 
prove the lemma by induction on j . (Basis) It holds from Lemma 1 (Inductive 
step) Suppose \dS\ > \S\ + j + 1 as the induction hypothesis. Consider adding 
a new component Cj+i into S. Let c be the number of nodes in Cj+\. Since 
r(T) Cj+i, from Lemma 1, \dVc m+1 \ > c + 2 holds. At most one node is shared 
by dS and dV Cm+1 ,wehave\dSuV Cj+1 \ > |5|+j + l+c+2-l > {\S\+c)+{j+l). 
The lemma is proved. □ 

6.3 Proof of Lemma 3 

Proof. We omit the subscript 77 of Qn for short. We divides Q fl Vx into three 
mutually-disjoint subset Si, S2, and ^3: Let Si C VxCiQ be the set of nodes that 
have no neighbor belonging to QnVx, S£_= dSlr\Q, and ^3" = (QnVx)\(^iU^) 
(see Figure 5). Since X is ^-occupied, S2 consists only of internal nodes. Thus, 
from Corollary 1, j^l = |<9Si| > |Si| holds. By the definition of S2 and ^3, 
52 C (dQ) n V x and ^3 C (9Q) n Vx hold. Consequently, we have 

2\dQnV x \ > 2(\!h\ + \S^\) 

> |S2|+ 21^1 + 1^1 

> IQnVxl- 



The lemma is proved. 



□ 



: node in Q 



O : node not in Q 



Fig. 5: Illustration of Si, S2, and S3 in the proof of Lemma 3. 



6.4 Proof of Lemma 4 



Proof. Let Z = I(Vx) for short. First, we fix a subset 5" C of S with cardinal- 
ity ot\Z\, and consider the probability that II(S') C Z holds. Without loss of 
generality, we can regard 77 as a permutation on Z whose first |S| elements are 
mapped to Z . Thus, to compute the probability, it is sufficient to count the num- 
ber of permutations where any element in 5" appears at the first \Z\ elements 
of 77. Dividing the counted number by n\, the probability can be calculated as 
follows: 




\Z\(\Z\-l)(\Z\-2)---(\Z\-a\Z\ + l) 
n(n-l)(n-2)---(n-a\Z\ + l) 



< 




Using the union bound, we can obtain the following bound: 



Pr(77(S*) > a\Z\) < Pr( (J S' C Z) 



S'CS\S'=a\Z\ 




Since \Z\ = \S\ — k holds, the lemma is proved. 



□ 



6.5 The Calculation Details in the Proof of Theorem 1 

We describe the detailed calculation to lead the last inequality in the proof of 
Theorem 1. 




Using the bound < (ne/m)' m and the condition k < |5|/16, 




By numeric calculation, we have log((32e) 1 / 16 (192e/17) 17 / 192 ) < 0.841 and 
(11/12-1/16) > 0.854. Thus, 
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